A Latent Class Modeling Approach for Differentially Private Synthetic Data for Contingency Tables

نویسندگان

چکیده

We present an approach to construct differentially private synthetic data for contingency tables. The algorithm achieves privacy by adding noise selected summary counts, e.g., two-way margins of the table, via Geometric mechanism. posit underlying latent class model estimate parameters based on noisy and generate using estimated model. This allows agency create multiple imputations with no additional loss, thereby facilitating estimation uncertainty in downstream analyses. illustrate a subset 2016 American Community Survey Public Use Microdata Sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Releasing Private Contingency Tables

Statistical agencies such as the US Census Bureau routinely release aggregate statistics about the general population. These statistics are often reported in the form of contingency tables. A 2-dimensional contingency table is an (m + 1) × (n + 1) matrix over two attributes that are binned into m rows and n columns. For instance, the attributes could be Age binned into buckets of length 10 and ...

متن کامل

A Unified Approach for the Multivariate Analysis of Contingency Tables

We present a unified approach to describing and linking several methods for representing categorical data in a contingency table. These methods include: correspondence analysis, Hellinger distance analysis, the log-ratio alternative, which is appropriate for compositional data, and the non-symmetrical correspondence analysis. We also present two solutions working with cummulative frequencies.

متن کامل

Relational models for contingency tables

The paper considers general multiplicative models for complete and incomplete contingency tables that generalize log-linear and several other models and are entirely coordinate free. Sufficient conditions of the existence of maximum likelihood estimates under these models are given, and it is shown that the usual equivalence between multinomial and Poisson likelihoods holds if and only if an ov...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The journal of privacy and confidentiality

سال: 2022

ISSN: ['2575-8527']

DOI: https://doi.org/10.29012/jpc.768